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Abstract 

Pictorial representations can play a pivotal role in both printed and digital learning material. Although there has 
been extensive research on cognitive techniques and strategies for learning from text, the same cannot be said for 
static and dynamic pictorial representations. In this paper we propose a systematic characterization of cognitive 
learning techniques that is founded on both theoretical and empirical research. The characterization relates the 
learning techniques to classes of cognitive processes as well as to textual and pictorial representations. We show 
how successful strategies for learning from both plain text and illustrated text are covered by the characterization. 
We also exemplify how the construction of new strategies for pictorial representations can be informed by the 
characterization. 

Keywords: pictorial representations, textual representations, cognitive processes, learning techniques, learning 
strategies 

1. Introduction 

In recent years, digital technology has revolutionized the production and use of educational materials. Firstly, 
conventional resources such as textbooks have become far more profusely illustrated than before due to the 
extensive cost savings made possible by the switch from analogue to digital production methods (cf. Pettersson, 
2008). Secondly, digitization of information has greatly facilitated the seamless combination of varied forms of 
learning media and allowed these combinations to be interactive. A feature of the resulting multimedia learning 
environments is that they are not restricted to using static pictures only - dynamic graphics such as animations 
now play a pivotal role in these resources. 

These fundamental changes in production and presentation in turn prompted a reconsideration of the way 
comprehension and learning processes were theorized. For instance, the concept of literacy was traditionally 
confined - as the term implies - to the production and comprehension of texts, i.e., verbal representations (e.g., 
Street, 1984). However some more recent semiotic perspectives have sought to expand the concept of literacy to 
encompass other ways of representing information including pictorial and interactive portrayals (e.g., Doelker, 
1997; Kress, 2003; Kress & van Leeuwen, 2006; OECD, 2000). An alternative view (e.g., de Vries & Lowe, 
2010) maintains that the term graphicacy rather than literacy should be used for pictorial materials because of 
deep seated differences between the sign systems of textual and pictorial representations (cf. Schnotz, 2001). 
Furthermore, psychological theories of learning from textual representations (e.g., Kintsch & van Dijk, 1978) 
were broadened to include learning from combinations of texts and pictures (e.g., Mayer, 2009; Schnotz & 
Bannert, 2003). Last but not least, a considerable body of research on multimedia learning aims to design 
combinations of textual and pictorial representations in such a way that students’ processing of information is 
facilitated (for overviews see Clark & Mayer, 2008; Mayer, 2005a). The design of representations alone, 
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however, cannot guarantee effective learning. Rather, it is only one side of the coin. The other side involves the 
cognitive processes that students actually apply to the learning material. These processes are manifested in the 
techniques and strategies that students use when learning from text and pictures. 

According to Streblow and Schiefele (2006), a learning strategy can be understood as a sequence of 
efficient learning techniques, which are used in a goal-orientated and flexible way, for which processing is 
increasingly automated, but which are nevertheless consciously applied” (p. 353; translation by the authors). 
Learning techniques are hence the individual components that are collectively employed as an overall learning 
strategy. Examples of learning techniques include highlighting key phrases in a text, or marking important 
regions in a picture. Learning techniques constitute a learning strategy only when a number of them are 
coordinated in a goal-oriented way. The aim of applying a learning strategy is to induce, support, and sustain 
effective learning processes. 

Three broad classes of learning strategies are typically distinguished (e.g., Streblow & Schiefele, 2006): 
cognitive strategies, metacognitive strategies, and resource strategies. The aim of cognitive strategies is to foster 
effective primary information processing, whereas metacognitive strategies focus on the super-ordinate level of 
the planning, monitoring and regulation of cognitive processes. In contrast to these internally focused strategies, 
resource strategies focus on establishing supportive external learning environments (e.g., ensuring access to 
learning resources, avoiding distractions, etc.). The present paper concentrates on cognitive learning strategies 
such as organizational strategies, elaboration strategies, and rehearsal strategies (cf. Weinstein & Mayer, 1986; 
Wittrock, 1974, 1989) and their component techniques. 

Research on cognitive techniques and strategies for learning from textual representations has a long and 
extensive tradition that reaches back to the influential work of Robinson (1946). In stark contrast, techniques and 
strategies for learning from static and dynamic pictorial representations remain relatively neglected by 
researchers. This probably explains why techniques and strategies for learning from pictorial representations are 
rarely taught to students at school or pre-service teacher students at the university (cf. Kremling, 2008). The 
potentially damaging consequences of this neglect with respect to pictorial representations are exacerbated by 
educators who mistakenly assume that pictures are not only intrinsically beneficial but also self-explanatory (e.g., 
Lieber, 2008). Further, students may experience “illusions of understanding” when asked to learn from pictures 
(cf. Salomon, 1983, 1984) but not actually comprehend. Research has demonstrated that students can have 
severe difficulties in understanding static (e.g., Weidenmann, 1989, 2006) and dynamic pictures (e.g., Lowe, 
2003, 2004, 2008; Ploetzner, Bodemer, & Neudert, 2008; Ploetzner, Lippitsch, Galmbacher, Heuer, & Scherrer, 
2009). Thus, while present-day picture-rich learning materials used in schools and universities place many and 
varied demands on the students’ cognitive processes, students may lack the cognitive techniques and strategies 
needed to cope successfully with these requirements. A central goal of this paper is to provide a systematic and 
principled framework designed to help researchers construct additional cognitive techniques and strategies for 
learning from pictorial representations that complement those already developed for textual representations. 

This paper is made up of four main parts. In the first part, we note that while extensive research on cognitive 
techniques and strategies for learning from textual representations has been conducted, research on techniques 
and strategies for learning from static and dynamic pictorial representations is still sparse. In the second part, we 
summarize important theoretical models of learning from textual and pictorial representations. In the third part, 
we posit a systematic characterization of cognitive techniques for learning from textual and pictorial 
representations. The characterization builds on both (a) current theories and models of learning from different 
representations and (b) existing research on cognitive learning techniques and strategies. It makes evident broad 
deficits in the research on cognitive techniques for learning from pictorial representations. In the fourth part, we 
show how successful strategies for learning from text-based representations are covered by the characterization 
as well as how the generation of new techniques and strategies for pictorial representations can be informed by 
the characterization. The paper concludes by suggesting possible directions for future research on cognitive 
techniques and strategies for learning, particularly with respect to pictorial representations. 

2. Previous Research on Cognitive Techniques and Strategies for Learning from Textual and Pictorial 
Representations 

Early research on learning strategies was carried out at a time when written text was the dominant form of 
information representation. Unsurprisingly, it focused on textual representation with the main objective being to 
identify how successful and less successful learners differed in their strategic behavior during learning. Marton 
and Saljo (1984), for example, empirically identified two different approaches to learning from texts. They 
distinguished between a surface level approach, such as the repeated reciting of text in order to remember it, and 
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a deep level approach, in which the text information is processed more intensively and elaborated in order to 
comprehend it. Successful learners were found to favor deep level approaches. Similar observations with respect 
to the depth of text processing have been made in other studies such as those by Dornisch, Sperling and Zeruth 
(2011), Pask (1976), Schlag, Florax and Ploetzner (2007), and Svensson (1977). 

Consistent with these findings, text learning strategies have been devised that aim to induce and promote 
beneficial elaboration and comprehension processes. For instance, building on the early work of F. P. Robinson 
(1946), Thomas and H. A. Robinson (1972) devised the well-known PQ4R-Method (Preview, Question, Read, 
Reflect, Recite, Review) that structures learning from text by means of six steps: (1) Survey the material to get a 
general overview (Preview), (2) formulate questions about the text (Question), (3) read the text thoroughly while 
keeping the formulated questions in mind (Read), (4) reflect on the text by relating the information to prior 
knowledge and formulating examples (Reflect), (5) answer the questions by giving an account of the text in 
one’s own words (Recite), and (6) try to recall or summarize the information that has been read without looking 
at the text (Review). Further examples of strategies for learning from texts are MURDER (Mood, Understanding, 
Recall, Digest, Expanding, Review) from Dansereau et al. (1979) and REDUTEX (Reduce Text) from Friedrich 
(1995). 

Current research on learning strategies is still geared almost exclusively to learning from texts (for overviews see 
Artelt, 2000; Gambrell, Morrow & Pressley, 2007; Mandl & Friedrich, 1992, 2006; Pressley & Flarris, 2006). 
This is despite the fact that text no longer occupies the dominant position it once did in the educational landscape. 
Instead, today’s learning materials frequently contain a substantial proportion of pictures, very often more than 
the amount of text. In order to provide a form of support paralleling that available for text-only materials, 
researchers such as Larson et al. (1986), Schlag and Ploetzner (2009, 2011), and Seufert (2009) have proposed 
strategies for learning from illustrated texts. For instance, the strategy employed by Schlag and Ploetzner (2009, 
2011) is made up of a sequence of learning techniques that aim to address not only the text and pictorial 
components, but also the relations between these different types of representation. Two experimental studies 
showed that students who took advantage of the strategy exhibited significantly larger learning gains than 
students who wrote a summary of the learning material. Research by Metz and Wichert (2009) also showed that 
a similar strategy for learning from illustrated text produced beneficial outcomes. 

With respect to facilitating learning from static pictures, only single, isolated learning techniques - as opposed to 
learning strategies - have been proposed and empirically evaluated. For example, in order to encourage learners 
to process pictures more deeply, both Salomon (1984) and Weidenmann (1989) asked learners to pay special 
attention to the pictures while orienting themselves towards the learning material. Peeck (1994) challenged 
learners to create pictures of their own and asked learners to answer questions about the target pictures (see also 
Weidenmann, 1994). Weidenmann (1988) prompted learners to compare different pictures. Ainsworth and 
Loizou (2003) encouraged learners to explain the pictures to themselves. Although each of these individual 
techniques may have its own value in helping to support picture comprehension, on their own they do not 
constitute a comprehensive learning strategy. 

For dynamic pictures such as animations and simulations, research on learning strategies and learning techniques 
is particularly sparse. De Koning, Tabbers, Rikers and Paas (2010) asked learners to explain an animation to 
themselves. Flegarty, Kriz and Cate (2003) asked learners to express verbally relations between verbal and 
pictorial information presented in an animation. Moreno and Valdez (2005) supported learning from animation 
by directing low-achievers to organize segments of an animation into an appropriate sequence. Fligh-achievers, 
in contrast, benefitted most from splitting an animation into meaningful segments by themselves. 
Schmidt-Weigand (2005) encouraged learners to stop an animation when the display showed important 
information and then analyze the display’s components as well as their relations. Gane and Catrambone (2006) 
posed questions asking learners for information about various aspects of the animated process. Taking advantage 
of de Jong’s (2005) concept of guided discovery learning, Fluk and Ludwigs (2009) encouraged learners to 
self-explain an animated simulation. 

Until recently, research on learning from dynamic pictures also focused exclusively on individual learning 
techniques rather than investigating comprehensive learning strategies. However, Kombartzky, Ploetzner, Schlag 
and Metz (2010) have proposed a first strategy for fostering deep level learning from animation. The strategy 
encompasses a sequence of eight different techniques for learning from narrated animations. In two experimental 
studies, students who made use of the strategy learned significantly more than students who wrote a summary of 
the animated process. 
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3. Theories of Learning from Textual and Pictorial Representations 

Current techniques and strategies for learning from textual and pictorial representations share an underlying 
objective: They all aim to foster deep level learning by inducing fundamental cognitive processes such as the 
identification, selection, organization, elaboration, transformation, and integration of information. Unsurprisingly, 
such processes also play a pivotal role in theories and models of learning from textual and pictorial 
representations. Influential examples are Kintsch and van Dijk’s (1978) theory of text comprehension, 
Wittrock’s (1974, 1989) theory of generative learning as well as the application of these theories to multimedia 
material by Mayer (2009; see also Mayer, 2005b) and Schnotz and Bannert (2003; see also Schnotz, 2005). 

In his theory of multimedia learning, Mayer (2009) characterizes multimedia material as combinations of spoken 
or printed texts and static or dynamic pictures. In accord with Atkinson and Shiffrin (1971) and Baddeley (1986), 
Mayer (2009) assumes that the human memory is divided into three sub-systems: the sensory registers, working 
memory, and long-term memory. Mayer (2009) formulates three basic assumptions concerning the working 
memory. First, the working memory is made up of both an auditory-verbal and a visual-pictorial channel 
(Baddeley, 1986; Paivio, 1986). Second, the capacity of the working memory is limited, i.e., only a limited 
amount of information can be processed simultaneously (Atkinson & Shiffrin, 1971; Baddeley, 1986). Third, 
successful learning from different representations requires an active processing and integration of the 
information presented (Wittrock, 1989). 

Mayer (2009) regards four types of cognitive processes as being important to successful multimedia learning: the 
selection, transformation, organization, and integration of information. The first step when learning from 
multimedia is to select relevant words from the available texts and relevant graphic entities from the available 
pictures. Subsequently, mental images may be transformed into auditory-verbal information by means of internal 
verbalizations and auditory-verbal information may be transformed into mental images by means of internal 
visualizations. Next, the material needs to be further processed in order to gain an understanding of the 
information and to retain it in long-term memory. For this purpose, it is necessary to organize the selected 
information. At first this takes place separately for textual and pictorial information so that both a verbal model 
and a pictorial model is constructed. Thereafter, in order to store the information in the long-term memory, both 
the verbal and pictorial models, as well as already existing knowledge must be integrated by relating 
corresponding elements to each other. 

Models of multimedia learning are more concerned with how text-picture combinations are processed than with 
how the individual textual and pictorial components themselves are processed. Finer detail about how individual 
representations are processed can be found in the work of researchers such as Kintsch and van Dijk (1978; see 
also Kintsch, 1998) who provide a comprehensive characterization of text processing. Flowever, the literature on 
how graphics are processed is much less extensive than it is for text. With respect to the comprehension of 
graphics such as pictures and diagrams, Weidenmann (1988), Pinker (1990), and Winn (1993) have made 
significant theoretical contributions. Weidenmann (1988) distinguishes between two different modes of 
understanding: ecological understanding and indicatorial understanding. Ecological understanding concerns the 
recognition of entities and scenes in a picture. In order to accomplish this task, Weidenmann (1988) assumes that 
learners apply the same processes to a picture that they do to visual perceptions of the natural environment. 
Depictive codes in the picture assist the learner in constructing the appropriate surface structures. Examples of 
depictive codes are the use of perspectives and colors. Indicatorial understanding, in contrast, concerns the 
identification and comprehension of visual arguments in a picture. In order to achieve this goal, the learner needs 
to interpret directive codes in the picture as indicators for arguments. Examples of directive codes are the use of 
accentuations, comparisons, enlargements, and directing symbols such as arrows. 

Numerous principles for the design of visual arguments have been developed (e.g., Tufte, 2005). Because 
instructional pictures invariably require the learner to extract visual arguments, they demand not only ecological 
but also indicatorial understanding. Many learners, however, process instructional pictures only ecologically (i.e., 
they apply a surface level approach) and not indicatorially (i.e., they do not make use of a deep level approach; 
cf. Weidenmann, 1989, 2006). 

Weidenmann (1988) further assumes that two different types of processes are essential for the understanding of 
instructional pictures: pre-attentive processes and attentive processes. While pre-attentive processes are made up 
of procedures that are automatically executed within fractions of a second and without conscious control, 
attentive processes encompass procedures that are consciously and intentionally executed over longer periods of 
time. By means of attentive processes, learners systematically search for information and draw conclusions from 
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the acquired information. Whereas ecological understanding might be accomplished through pre-attentive 
processes, indicatorial understanding typically requires additional attentive processes. 

Weidenmann (1988) focuses on learning from realistic and schematic pictures (see also Weidenmann, 1994) in 
which there is a high degree of structural analogy between the pictures and the represented objects. Pinker 
(1990), in contrast, has proposed a model of how learners understand abstract graphics such as line diagrams, bar 
charts, and pie charts for which the structural correspondences are less direct. Like Weidenmann (1988), Pinker 
(1990) also distinguishes between pre-attentive and attentive processes when learning from a diagram. 
Perceptually-driven pre-attentive processes enable learners to first identify the basic elements of a diagram such 
as forms, positions, inclines, and angles. These elements are organized as perceptual groups which form an 
analog representation known as a visual array. Such visual arrays are subsequently encoded as symbolic 
representations in working memory. 

Pinker contends that perceived elements and their spatial relations are described by means of propositions within 
these representations. In the next step, the propositional representation is matched against schemata of diagrams 
in long-term memory. The schema that corresponds best to the propositional representation becomes instantiated 
with the information contained in the propositional representation. A schema is composed of both declarative 
and procedural knowledge about a specific type of diagram. While declarative knowledge encodes information 
about the geometrical features of a diagram, procedural knowledge represents how certain information can be 
inferred from a diagram. Finally, the instantiated schema is drawn upon in order to form conceptual relations, to 
initiate the retrieval of additional information from the diagram, and to infer information not explicitly 
represented in the diagram. Pinker’s (1990) model emphasizes the need for learners to acquire schemata of 
diagrams in order to be able to process diagrams effectively and efficiently. 

Winn (1993) also focuses on how viewers process diagrammatic information. While acknowledging that both 
pre-attentive and attentive processes are involved in extracting information from diagrams, he gives particular 
emphasis to the role of perception. In Winn’s account, pre-attentive processing is broken down into two 
complementary processes - discrimination in which a diagram’s constituent symbols are singled out as individual 
entities, and configuration in which those entities are grouped into larger assemblies. Perceptual issues such as 
visual contrast and the emergent properties of symbols are identified as fundamental to readers’ discrimination 
and configuration of the depiction’s component entities. These visual processes are not typically given 
commensurate consideration in accounts of text processing which tend to have a largely cognitive orientation. 

For diagram search to be effective, the contribution of bottom-up pre-attentive aspects such as perceptual 
salience must be complemented by top-down influences derived from knowledge of both (i) the symbol system 
together with its diagrammatic conventions, and (ii) the depicted content. These attentive contributions help 
guide diagram readers as to what they should look for and where they should look for it. In addition to 
supporting the selection of information, attentive processing on the basis of content knowledge schemas can help 
readers to organize the information they select into meaningful assemblies or chunks. 

Winn also suggests that the process of searching for information in diagrams can be cumulative in that multiple 
sweeps may need to be made in order to achieve a goal. The strategies that a reader uses to scan a diagram are 
influenced by the nature of that goal and the reading conventions assumed to be appropriate for the particular 
display’s structure. Although some strategies (such as partitioning a display into sections) are applicable across 
different types of diagram, their effectiveness in facilitating learning can depend on the reader’s knowledge of 
the depicted content. 

The most distinctive difference between dynamic representations such as animations and static representations is 
their temporal structure. However, research on how learners actually process animated displays has not kept pace 
with the rapidly expanding educational uses of animation. Systematic exploration of perceptual and cognitive 
processing factors that can influence learning from animation is a relatively recent development. A key 
contribution in this area was made by Kriz and Hegarty (2007) who identified the importance of both bottom-up 
and top-down influences in learning from animation. The interplay of these two influences is also a central pillar 
of the Animation Processing Model (APM) which is particularly concerned with how dynamics influence 
information extraction and integration (Lowe & Boucheix, 2008, 2011). The APM characterizes learning from 
animation in terms of five processing phases with a progression from a bottom-up to a top-down emphasis: (1) 
localized perceptual exploration, (2) regional structure formation, (3) global characterization, (4) functional 
differentiation, and (5) mental model consolidation. These phases involve iterative and cumulative processing of 
the externally presented dynamic portrayal that, if successful, results in the learner constructing a high quality 
mental model of the subject matter depicted in the animation. 
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During localized perceptual exploration, the learner uses largely perceptually-based processing to parse the 
continuous flux of an animation into differentiated components. This parsing results in a set of event units that 
are the basis for the subsequent processing operations. Each event unit consists of an entity depicted in the 
animation plus its associated behavior. The process of isolating event units from the animation provides the 
building blocks from which a mental model of the to-be-learned subject matter will be constructed. 

During regional structure formation, the learner uses aspects such as gestalt relations between neighboring event 
units to connect this parsed material into small regional clusters of activity. However, the resulting dynamic 
micro-chunks are distributed across the animation in spatially and temporally separated locations. In order to 
develop a coherent internal representation from the individual dynamic micro chunks, the process of global 
characterization bridges these isolated islands of activity into broader relational structures. 

A key process involved in this inter-linking is the establishment of domain general causal chains that are not 
based on the specific functionality of the referent subject matter. The process of functional differentiation 
introduces such functionality by applying domain specific background knowledge to the relationships previously 
established and reinterpreting them where necessary. This allows the causal chains to be characterized in terms 
of the specific functional purpose of the content depicted in the animation. 

Finally, during mental model consolidation flexibility is introduced into the mental model by extending the 
functionality of the depicted system to cover a variety of operational requirements. This gives the learner the 
capacity to deal with the behavior of the system in unfamiliar circumstances, such as those encountered in 
problem solving situations. 

Although the five phases are described here sequentially, this is not meant to imply that they will necessarily 
occur in this neat linear fashion during learning. Rather, a recursive alternation between phases would typically 
be expected as the learner progressively works with the animated material. Further, the quality of the mental 
model ultimately developed would very much depend on the extent of an individual learner’s domain specific 
background knowledge regarding the depicted content. Without such knowledge, the learner may have difficulty 
in extracting the required raw material from an animation and then integrating it into a coherent and functionally 
effective whole. 

Various broad classes of cognitive - as well as visual - processes are addressed in many theoretical accounts of 
learning from textual and pictorial representations (e.g., Kintsch, 1998; Mayer, 2009; Schnotz & Bannert, 2003; 
Wittrock, 1974, 1989): (1) orientation and overview, (2) comprehensive scanning, (3) identification and selection 
of information, (4) organization of information, (5) elaboration, transformation and integration of information, 
and (6) rehearsal. The first three classes of cognitive processes are concerned with both exploring the 
information space and selecting informational units. The fourth and fifth classes address the cognitive processes 
involved in the generation of structure and meaning. The sixth class, rehearsal, focuses on cognitive processes 
that concern the remembering of information such as systematic recapitulation and mnemonic techniques. 

4. A Systematic Characterization of Cognitive Techniques for Learning from Textual and Pictorial 
Representations 

Although theoretical accounts of learning from textual and pictorial representations suggest that the classes of 
cognitive processes described above form the foundation for understanding textual and pictorial representations, 
the preceding summary of research on learning techniques and strategies indicates that these processes can be 
induced in a variety of different ways. On the basis of (a) the learning techniques and strategies reported in the 
literature, (b) the theories and models of learning from textual and pictorial representations described above, and 
(c) our own analyses, we developed a systematic characterization of learning techniques (cf. Table 1) that deals 
with all major classes of representations: learning from texts, pictures, diagrams, combinations of texts and 
graphics, and animations. The characterization brings together these different types of representations within a 
common framework in order to provide a unified approach to areas that until now have lacked cohesion. In doing 
so, it highlights significant gaps that exist in research and practice with respect to pictorial representations. 

The rows of Table 1 are organized according to the six broad classes of cognitive processes as described above. 
The columns are arranged according to the major classes of representations. Each cell shows learning techniques 
that aim to induce the corresponding cognitive process and that are applicable to the corresponding 
representation. The numbers in brackets attached to the techniques refer to research that has theoretically 
conceptualized or empirically investigated the corresponding techniques. The complete references are provided 
at the end of the paper. 
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Because the main concern of this paper is with learning from pictorial representations, in Table 1 the references 
to research on techniques for learning from text are selective. We acknowledge that far more research on 
techniques for learning from text has been conducted (for overviews see Artelt, 2000; Gambrell, Morrow & 
Pressley, 2007; Mandl & Friedrich, 1992, 2006; Pressley & Harris, 2006). However, we are unaware of further 
research on cognitive techniques for learning from static and dynamic pictorial representations, i.e., research that 
has theoretically conceptualized or empirically investigated such techniques. With respect to learning material 
that comprises pictorial representations, most of the techniques that have been suggested are intended to 
facilitate learning from illustrated texts. In accordance with current theories and models of multimedia learning 
(e.g., Mayer, 2009; Schnotz, 2005), many of these techniques encourage the students to interrelate the textual 
and pictorial representations structurally as well as semantically. 

In those cases where we are unaware of research on specific techniques for learning from static and dynamic 
pictorial representations, we formulated such techniques on the basis of current theories and models of how 
learners process pictorial representations. For instance, various researchers suggest that a text should be read 
carefully and completely after overviewing it (e.g., Dansereau et al., 1997; Robinson, 1946; Thomas & Robinson, 
1972). We are not aware of research that specifies comparable techniques for learning from animations. 
However, research conducted by Lowe (1999, 2003, 2004, 2008) demonstrates that learners easily miss 
important entities in the pictorial display of an animation. Therefore, in Table 1 we formulated two techniques 
that concern the cognitive process of comprehensive scanning while learning from animations: (1) scan the 
animation’s display and (2) if the animation is narrated, listen to the animation’s narration systematically and 
completely. 

In developing Table 1, special emphasis was placed on abstracting generic approaches that are super-ordinate to 
the individual techniques specific to certain kinds of representation (cf. the second column in Table 1). These 
abstracted approaches lie in a conceptual space between (a) the fundamental cognitive processes they aim to 
induce, and (b) the representation-specific techniques associated with particular representation types or their 
combinations. They express the overarching purpose that is manifested in the form of different 
representation-specific techniques. 

In order to explore the potential of our characterization of cognitive learning techniques, in the next section we 
illustrate how successful strategies for learning from text as well as from illustrated text are covered by the 
characterization. We also show how the characterization can be used for constructing a new strategy for learning 
from animation. 


Table 1. Cognitive learning techniques categorized according to cognitive processes and representations: The 
numbers in brackets refer to research that has theoretically conceptualized or empirically evaluated the 
corresponding techniques 

Techniques specific to representations 


Class of cognitive 

process 

Techniques across 

representations 

Text 

Picture 

Diagram 

Text-graphic-combination 

Animation with or 

without narration 

1. Orientation and 

gauge scope 








overview 

“fly over” 

• skim read (2) 

• coarsely scan 

• coarsely scan 

• 

skim read (35, 37, 38) 

• 

coarsely scan 






• 

coarsely scan 

• 

hear through 

Main purposes: 

identify general topic 

(1.2) 




(36, 37, 38) 


(46) 

• overview 

put into context 








• identify general 

browse through 

• flip through 

• browse visually 

• browse visually 

• 

flip through 

• 

zap 

topic 





• 

browse visually 

• 

browse visually 

• activate prior 

process surrounding 

• read introduction, 

• read caption, 

• read caption, legend, 

■ 

read introduction, 

• 

read description 

knowledge 

information 

summary (1), 

legend, visual 

visual key, read labels 


summary, conclusion, 

• 

listen to 

• develop focus 


conclusion, 

key, labels 



headings 


introduction, 



headings (11, 12, 8, 



• 

read caption, legend, 


summary, 



9) 




visual key, labels 


conclusion 
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• read labels 


recall prior knowledge (2) 

raise questions (2) 


2. 

Comprehensive 

process systematically 

• 

read systematically 

• 

scan 

• 

scan systematically and 

• 

read systematically and 

• 

scan systematically 


scanning 

and completely 


and completely (1, 


systematically 


completely 


completely 


and completely 





2, 3) 


and completely 




(39, 35, 36, 37, 38) 

• 

listen systematically 

Main 

purposes: 








■ 

scan systematically and 


and completely 

• 

become aware of 









completely 




total information 













space 












• 

identify sub-topics 












• 

identify course of 













argumentation 












3. 

Information 

establish boundaries 

• 

distinguish verbal 

• 

distinguish 

• 

distinguish 

■ 

distinguish verbal units 

• 

distinguish verbal 


identification and 



units (e.g., words, 


fore-ground/ 


fore-ground/background, 


(e.g., words, phrases, 


units (e.g., words, 


selection 



phrases, statement; 


background, 


graphical units (e.g., 


statements), 


phrases, statements), 





2, 3, 5, 6) 


graphical units 


entities, regions), symbols 


foreground/back-ground, 


foreground/ 

Main 

purposes: 





(e.g., entities, 




graphical units (e.g., 


background, 

• 

analyse 





regions; 26, 27) 




entities, regions), 


graphical units (e.g., 

• 

evaluate 









symbols 


entities, regions), 

• 

select 











symbols, event units, 

• 

record 











invariants 



evaluate and select 

• 

select key verbal 

• 

select key 

• 

select key graphical units, 

• 

select key verbal units, 

• 

select key verbal 





units (1, 2, 3, 4) 


graphical units 


symbols 


graphical units, symbols 


units, pictures, 







(27, 29, 30) 




(27, 35, 37, 38, 39) 


graphical units, 













symbols, event units, 













invariants (46, 50) 



index and record 

• 

underline, 

• 

highlight, mark 

• 

highlight, mark 

■ 

underline, highlight, mark 

• 

take notes (46,47, 





highlight, mark (4, 


(26, 27) 

• 

draw sketch 


verbal units, graphical 


48) 





7, 8, 9, 10, 11, 12) 

• 

draw sketch (26, 




units, symbols (26, 36, 

• 

draw sketches (46) 




• 

take notes 


30) 




37, 38) 







(2,4, 11, 12) 





• 

take notes 












• 

draw sketch (15) 



4. 

Organization 

link (e.g., draw 

• 

link verbal units 

• 

link graphical 

• 

link graphical units, 

■ 

link verbal units, 

• 

link verbal units, 



connecting lines) 


(3, 5, 6) 


units (4) 


symbols 


graphical units, symbols 


graphical units, 

Main 

purposes: 









(27, 36, 37, 38, 39,40, 


symbols, event units, 

• 

structure 









41,42) 


invariants (50) 

• 

synthesize 

group (e.g., circle, 

• 

group verbal units 

• 

group graphical 

• 

group graphical units, 

■ 

group verbal units, 

• 

group verbal units, 



box, color code) 


(3, 5, 6) 


units (35) 


symbols 


graphical units, symbols 


graphical units, 













symbols, event units, 













invariants 



rearrange (e.g., 

• 

rearrange verbal 

• 

rearrange 

• 

rearrange graphical units 

• 

rearrange verbal units, 

• 

rearrange verbal 



organize into 


units (5, 6) 


graphical units 




graphical units, symbols 


units, graphical 
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sequence, table, 





(35) 


units, symbols, event 



hierarchy) 







units, invariants (49) 



create higher order 

• establish sections 

• establish regions 

• establish regions 

• 

establish sections (35) 

• 

establish scenes 



units 

(3) 

(35) 

• introduce labels 

■ 

introduce headings 

• 

establish regions in 




• introduce headings 

• introduce labels 


• 

establish regions 


pictures 




(4) 

(4, 15, 26,31, 


■ 

introduce labels 

• 

establish sections in 





33, 32) 



(15,31) 


narration 









• 

introduce headings 









• 

introduce labels (46) 

5. 

Integration, 

denote 

• denote verbal units 

• denote graphical 

• denote graphical units, 

• 

denote verbal units and 

• 

denote graphical 


transformation, 


and relations 

units and 

symbols and relations 


relations between them 


units, scenes and 


and elaboration 


between them (2, 5, 

relations 

between them 


(39,41,44) 


relations between 




6) 

between them 


• 

denote graphical units, 


them 

Main 

purposes: 



(26) 



symbols and relations 

• 

denote verbal units 

• 

interpret 






between them (39, 41, 


and relations 

• 

abstract 






44) 


between them 

• 

exemplify 





• 

denote relations between 

• 

denote relations 

• 

justify 






verbal units, graphical 


between verbal 

• 

appropriate 






units and symbols (39, 


units, graphical units 








31,41,44) 


and scenes (46) 



abstract 










exemplify 

(2) 









self-explain 

• Find examples (2) 


(13) 




(51) 


• Explain text to 
yourself (3, 13) 


relate to prior (2) 

knowledge 



appropriate 

• state in own words 

• draw own - e.g., 

• draw own - e.g., enriched 

■ 

state text in own words 

• 

draw own pictures, 



(1,3) 

abstract - picture 

- diagram (33, 34) 

• 

visualize text (20, 35, 36, 


scenes 



• visualize (8, 9, 11, 

(33, 32) 

• verbalize 


3, 37, 38,43, 45) 

• 

verbalize pictures, 



12, 14, 15, 16, 18, 

• verbalize (26) 

• write summary 

• 

verbalize picture 


scenes 



20,21,22) 

• write summary 



(35, 37, 38, 39,45) 

• 

state narration in 



• write summary (2, 



• 

write summary 


own words 



4,5,6, 8, 9, 11, 12) 




(35, 37, 38) 

• 

write summary (46) 








• 

visualize (52) 

6. Rehearsal 

re-process 

• re-read (1) 

• re-watch 

• re-watch 

• 

re-read 

• 

re-watch 


comprehensively 




• 

re-watch 

• 

re-listen 

Main purposes: 

re-process coarsely 

• skim read 

• coarsely scan 

• coarsely scan 

■ 

skim read 

• 

coarsely scan 

• remember 





• 

coarsely scan 

• 

hear through 


re-read own notes 









re-view own drawings 









apply mnemonic 

(23, 24, 25) 








techniques 
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5. Aligning Strategies with and Constructing Strategies from the Systematic Characterization 

In this section, we first show how two existing strategies fit into the systematic characterization: (1) the 
PQ4R-Method of Thomas and Robinson (1972) for learning from text and (2) the strategy for learning from 
illustrated text proposed by Schlag and Ploetzner (2009, 2011). Although the learning techniques employed in 
both strategies are already mentioned in Table 1 (cf. the index 2 for Thomas & Robinson, 1972, and the indexes 
37 and 38 for Schlag & Ploetzner, 2009, 2011), the table presents this information in a rather distributed way. 
Therefore, in Table 2 the component techniques of the selected strategies as well as their associated classes of 
cognitive processes are treated in a more compact way. Next, we exemplify how the systematic characterization 
of cognitive learning techniques can be aligned with the Animation Process Model of Lowe and Boucheix (2008) 
to generate a new strategy for learning from animation. 

Table 2. How two existing learning strategies fit into the systematic characterization of cognitive learning 
techniques and what a new learning strategy could look like 


Text 

Illustrated text 

Animation without 
narration 

Class of cognitive 

Thomas and Robinson 

Schlag and Ploetzner 

Based on Lowe and 

process 

(1972, PQ4R) 

(2009, 2011) 

Boucheix (2008, 2011) 

1. Orientation and 

• skim read 

• skim read 


overview 

• identify general topic 

• recall prior 
knowledge 

• raise questions 

• coarsely scan 


2. Comprehensive 

• read systematically 

• read systematically 

• scan systematically 

scanning 

and completely 

and completely 

and completely 

3. Information 

• distinguish verbal 

• select key verbal 

• distinguish entities 

identification and 

units 

units 

• distinguish event 

selection 

• select key verbal 
units 

• take notes 

• select key graphical 
units 

• underline, highlight, 
mark key verbal 
units 

• highlight, mark key 
graphical units 

units 

4. Organization 


• link verbal and 
graphical units 

• group event units 

5. Integration, 

• denote verbal units 

• visualize text 

• denote relations 

transformation and 

and relations 

• verbalize picture 

between event units 

elaboration 

between them 

• find examples 

• relate to prior 
knowledge 

• state in own words 

• write summary 

• write summary 

• self-explain 

• write summary 

6. Rehearsal 





Thomas and Robinson’s (1972) PQ4R-Method (Preview, Question, Read, Reflect, Recite, Review) structures 
learning from text by means of six steps. In the first step (Preview), the material is surveyed to get a general 
overview. Previewing takes advantage of various learning techniques such as skimming the text, identifying the 
general topic, and activating prior-knowledge. In the second step (Question), questions about the different 
sections of the text are formulated. In the third step (Read), the text is read systematically and completely. While 
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reading the text thoroughly, learning techniques such as identifying key words and key phrases as well as taking 
notes are utilized. This step can both lead to answers to some of the previously raised questions and give rise to 
new ones. Step four (Reflect) and step five (Recite) target the construction of meaning. Within these steps, 
learning techniques such as establishing relations between units of presented information as well as between 
units of presented information and prior knowledge are employed. Step six (Review) serves to evaluate and to 
foster the understanding gained. It encourages the formulation of an overall summary as well as the provision of 
answers to the questions previously raised. The text is re-read only if necessary. 

Although the PQ4R-Method does not employ techniques that explicitly encourage processes of information 
organization as conceptualized in Table 1, it nevertheless becomes evident from Table 2 that the PQ4R-Method 
focuses on encouraging a deep level approach to learning (cf. Marton & Saljo, 1984). Techniques targeting the 
rehearsal of information are deliberately avoided in this method. 

On the basis of current theories of multimedia learning (cf. Mayer, 2009; Schnotz & Bannert, 2003), Schlag and 
Ploetzner (2009, 2011) proposed and empirically evaluated a four-step strategy for learning from illustrated text. 
The aim of the first step is to provide a general overview. It employs the learning techniques of skim reading the 
text and coarsely scanning the illustrations. The second step focuses on establishing relations between the text 
and the illustrations. It encourages students to read the text thoroughly, to highlight key words and key phrases in 
the text, to highlight corresponding entities in the illustrations, and - as far as possible - to label the graphical 
entities by using the key words and key phrases thereby linking both representations. In the third step, the 
students are asked to describe the identified relations between the text and the illustrations in their own words by 
writing a summary. The fourth step also encourages students to summarize the most important information 
visually by drawing a sketch of their own. 

As with the PQ4R-Method of Thomas and Robinson (1972), the learning strategy proposed by Schlag and 
Ploetzner (2009, 2011) promotes a deep level approach to learning. While it focuses on establishing meaningful 
relations between text and illustrations, it does not put the same emphasis on the processing of the individual 
representations themselves. With respect to the processing of text, the strategy could be made more 
comprehensive by complementing it with techniques taken from established strategies for learning from text 
such as the PQ4R-Method. Regarding the processing of illustrations, however, we are not aware of learning 
strategies from which complementary techniques could be drawn. This lack of research on techniques and 
strategies for learning from pictorial representations becomes even more evident if learning from dynamic 
representations such as animations is considered. 

Kombartzky et al. (2010) proposed and evaluated a strategy for learning from narrated animations. As is the case 
with the strategy for learning from illustrated texts by Schlag and Ploetzner (2009, 2011), this narrated animation 
strategy was aligned with current theories of multimedia learning (Mayer, 2009; Schnotz and Bannert, 2003). 
Because we are not aware of any strategy for learning from animations without narration, we chose this type of 
representation to exemplify how the systematic characterization of learning techniques could be used to generate 
a new strategy. We chose the Animation Processing Model (APM) by Lowe and Boucheix (2008, 2011) as our 
theoretical basis because current theories of multimedia learning do not provide a detailed account of how 
learners process the spatial and temporal characteristics of animations. 

Based on the systematic characterization of cognitive learning techniques, Table 2 shows an example of a 
sequence of techniques that aim to induce the processes posited in the APM. Initially, the students are 
encouraged to scan the animation systematically and completely thereby identifying local events (localized 
perceptual exploration). Thereafter, the students are asked to group adjacent local events (regional structure 
formation). Next, the students need to establish causal relations between groups of events (global 
characterization). Afterwards the students are prompted to explain the functional purpose of each causal relation 
(functional differentiation). Finally, the students are asked to summarize the rules that underlie the displayed 
processes (mental model consolidation). 

Once more Table 2 reveals that the proposed strategy encourages a deep level approach to learning. Because the 
strategy is made up of only six learning techniques, it is a promising candidate for successful pedagogical 
intervention from a practical point of view. That is, teachers would have to describe, explain, and demonstrate 
only few techniques and students would have to internalize and exercise only a small number of techniques. 
However, whether the strategy actually benefits learning is of course an open question. To answer this question, 
empirical research is required. 
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6. Discussion 

This paper has presented a systematic characterization of cognitive techniques for learning from textual and 
pictorial representations. It aims to bring a measure of coherence to research that is currently distinguished by 
fragmentation, inconsistencies, and omissions. We showed that the characterization is consistent with established 
strategies for learning from either a single type of representation (e.g., text) or combined representations (e.g., 
illustrated texts). The characterization was also used to demonstrate how new learning strategies can be 
generated from theoretical accounts of how a particular type of representation is processed (e.g., un-narrated 
animation). 

Our analyses highlight deficits in current research on learning, especially with regard to pictorial representations. 
While in recent years much research has been conducted on improving the design of static and dynamic pictorial 
representations, this has not been matched by a corresponding level of investigation into cognitive techniques 
and strategies for learning from static and dynamic pictorial representations. However, well-designed pictorial 
representations of themselves are no guarantee of successful learning. Further, students will not always be 
presented with pictorial representations that are designed in an optimal way. Without suitable cognitive 
techniques and strategies for handling these representations effectively, learning is likely to be compromised. 

In the light of rapid growth in the quantity and variety of static and dynamic pictorial representations in learning 
material, far more research on cognitive techniques and strategies for learning from pictorial representations is 
required. Such research needs to encompass (a) further investigations of how static and dynamic representations 
are processed by students, (b) theory-driven conceptualization and empirical evaluation of techniques and 
strategies for learning from pictorial representations, and (c) the practical integration of empirically validated 
techniques and strategies into teaching practices. 

Current theories and models of how static and dynamic pictorial representations are processed by students (for 
an overview see Anglin, Vaez & Cunningham, 2003) form a promising starting point for conceptualizing 
learning techniques and strategies. For example, as described earlier, the APM (Lowe & Boucheix, 2008) 
characterizes learning from animation in terms of five processing phases with a progression from bottom-up to 
top-down processes: (1) localized perceptual exploration, (2) regional structure formation, (3) global 
characterization, (4) functional differentiation, and (5) mental model consolidation. 

Unlike other theories and models that deal with learning from pictorial representations such as Richard Mayer’s 
Cognitive Theory of Multimedia Learning (Mayer, 2009, 2005b), the APM explicitly takes account of properties 
that are specific to dynamic pictorial representations such as animations. However, whereas the APM constitutes 
a theoretical abstraction, a learning strategy corresponds to a specific pedagogical intervention. That is, the 
generation of a learning strategy requires the formulation of a particular sequence of distinctive techniques that 
students actually can apply either mentally or externally. Because a theoretical abstraction is always 
underdetermined with respect to a specific pedagogical intervention, the APM - like any other model of learning 
- cannot directly be mapped to a learning strategy. Therefore, as in the research conducted by Thomas and 
Robinson (1972) as well as by Schlag and Ploetzner (2009, 2010), learning techniques need to be specified that 
are suitable to induce the cognitive processes considered to be relevant in the underlying process model. 

As demonstrated by the gaps for the un-narrated animation example in Table 2, the characterization can highlight 
interesting possibilities that have not previously been considered. For example, it may be that the blank 
‘Orientation and overview’ cell for this type of representation suggests the APM could be elaborated to cover 
cases in which the learner is allowed more than a single pass through an animation. Further, deficiencies such as 
the present lack of techniques for easy learner identification and selection of graphic units within an animation 
could be addressed using technological innovations such as touch tablets. One possibility here would be for a 
learner to tag a target entity by briefly touching it and for the tagged entity to then remain highlighted throughout 
the animation’s time course. Singling out two or more entities in this way would allow not only the entities to be 
followed more readily, but also their relationships over time. Given that most existing learning techniques that 
address materials that contain pictures target relations between different types of representations, perhaps more 
techniques are needed that help learners process relations within a single representation, such as an un-narrated 
animation. 

Many of the techniques that comprise text processing strategies were derived from the findings of research 
comparing successful and less successful readers. The present relative lack of techniques for learning from static 
pictures and particularly animations could perhaps be addressed by building up a corresponding body of 
comparative research for these representations. However, the way animations are presented must also give 
learners the opportunity to apply effective techniques. As noted by Winn (1993), attentive processing takes 
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considerably longer than pre-attentive processing and the transitory nature of animation may preclude 
application of such techniques if learners are allowed only a single exposure to the available information. 

In the longer term, research needs not only to provide conceptualizations and evaluations of techniques and 
strategies for learning from different pictorial representations. These techniques and strategies also need to be 
structured into coherent approaches to learning. Which techniques apply to a range of representations and which 
techniques are specific to certain representations? To what extent is it possible to order learning techniques 
sequentially and hierarchically? 

Besides revealing that cognitive techniques for learning from pictorial representations require more research, the 
systematic characterization of learning techniques provided in Table 1 is an important first step in the 
construction of such a structure. The characterization describes which learning techniques might be recruited to 
induce specific cognitive processes when learning from particular representations. Because the characterization 
is systematic, it helps to identify learning techniques that can be applied to a range of representations as well as 
learning techniques that are specific to particular representations. Furthermore, the characterization supports the 
construction of new learning strategies that combine different learning techniques across the classes of cognitive 
processes. 

It is when students are equipped with a structured inventory of learning techniques that learning from textual and 
pictorial representations can become truly strategic. To be successful learners in the world of increasingly 
diverse representations, they must be able to identify relevant properties of representations and to appropriately 
select and combine techniques that take these properties into account during learning. 
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